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Description 

A portion of the disclosure of this patent document contains material which is subject to copyright protection. The 
copyright owner does not object to the reproduction by anyone of the patent document or the patent disclosure, as it 
5 appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatso- 
ever. 

RELATED APPLICATIONS 

10 This application is related to United States Serial No. (Beckman Docket No. 1 28D-1 22) entitled 

"Streptolysin O Varients" by Craig W. Adams, and United States Serial No. (Beckman Docket No. 

128D-123) entitled "Antibodies to Streptolysin O Derivatives and Varients", by Craig W. Adams and Patty Pang. Both 
applications are being filed simultaneously herewith, and both are incorporated herein by reference. 

15 FIELD OF THE INVENTION 

The present invention is generally related to Streptolysin O and more particularly to Streptolysin O derivatives 
produced by recombinant DNA technology. 

20 BACKGROUND OF THE INVENTION 

Disclosed herein is a derivative fusion product of the antigenic substance, Streptolysin O. Streptolysin O is asso- 
ciated in humans with, for example, rheumatic fever, such that immunodiagnostic assays for evidence of immunological 
response against Streptolysin O are routinely utilized. The disclosed derivative version of Streptolysin O is produced 
25 by recombinant DNA techniques, is soluble upon expression, is capable of being bound by at least one antibody capable 
of binding to wild type SLO, and is hemolytically active. Prior to this invention, Streptolysin O could be obtained via the 
bacteria Streptococcus pyogenes. The toxic and pathogenic properties of Streptolysin O are typically monitored by the 
lysis of red blood cells. 

30 I. The Genetic Code 

The genetic code for a particular protein, such as Streptolysin O (hereinafter "SLO"), depends upon the sequential 
grouping of three nucleotides, referred to as a "codon," and the arrangement of such codons in relationship to each 
other. 

35 A "nucleotide" consists of a nucleoside and one or more phosphate groups. A "nucleoside" consists of a nitroge- 

neous base linked to a pentose sugar. A "pentose" sugar comprises five carbon atoms. In a molecule of deoxyribonu- 
cleic acid, or "DNA", the pentose sugar is "deoxyribose," and the nitrogeneous base can be adenine ("A"), guanine 
("G"), thymine ("T") or cytosine ("C"). In a molecule of ribonucleic acid, or "RNA", the pentose sugar is "ribose", and 
the nitrogeneous bases are the same for DNA, except uracil ("U") replaces thymine. Three types of RNA, messenger 

40 RNA, or "mRNA", transfer RNA, or "tRNA", and ribosomal, or "rRNA", translate the genetic information encoded in the 
DNA into, e.g., a polypeptide or a protein. Thus, genetic information is generally transferred as follows: DNA->RNA->- 
protein. 

The sequence of the nitrogenous bases of the DNA molecule encodes the genetic information contained in that 
molecule. The sugar and phosphate groups of the DNA molecule perform a structural role, forming the backbone of a 

45 series of DNA molecules, referred to as a DNA "macromolecule." DNA consists of two complementary strands of 
nucleotide chains, and these strands are held together by (relatively) weak hydrogen bonds. The bases of each DNA 
molecule bind to each other: A always bonds with T and C always bonds with G. Thus, the sequence 5'-ATCG-3' of a 
first strand lies immediately opposite a complementary sequence 5'-TAGC-3' on the other strand. This is referred to 
as "complementary base pairing." The process of complementary base pairing is referred to as "hybridization" and 

50 results in the formation of a stable DNA macromolecule. 

Each codon specifies one amino acid. "Amino acids" are the principal components of proteins, and "proteins" are 
the essential constituents of all living cells. There are 20 natural amino acids. Because there are four nucleotide bases 
(A, C, G and T) and three nucleotides per codon, there are 64 possible codons (4 3 ). Accordingly, because there are 
only 20 natural amino acids, most amino acids are specified by more than one codon. This is referred to as "redundancy" 

55 or "degeneracy". For example, the codons GCG, GCA, GCT and GCC all encode for the amino acid Alanine. 

The codon ATG (Met amino acid codon) is the normal "start" codon. The codons TAA, TAG and TGA, which do 
not encode for amino acids, are the normal "stop" codons. The formation of mRNA is established based upon the start 
codon of one strand of the double stranded DNA macromolecule such that the resulting single stranded mRNA will 
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have a nucleotide sequence complementary to the sequence of a single strand of the DNA. When a stop codon is 
reached by the mRNA along the DNA molecule, transcription is stopped. 

The regions along the DNA macromolecule which are translated from the mRNA are referred to as "exons" for 
eukaryotes, and "translated regions" for prokaryotes. "Genes" include exons (eukaryotes) and translated regions 
s (prokaryotes). Thus, genes encode for proteins and/or polypeptides. Mammals, for example, are eukaryotes; bacteria, 
for example, are prokaryotes. 

The natural synthesis of protein takes place over a series of several steps. The first step is the formation of an 
mRNA macromolecule complementary to the DNA macromolecule, as noted above. Thereafter, tRNA is manufactured; 
the tRNA provides a complementary codon ("anti-codon") for each codon on the mRNA macromolecule. Thereafter, 
10 rRNA will catalyze the assembly of the codon-specific amino acids resulting from the mRNA:tRNA into proteins and/ 
or polypeptides. 

II. Recombinant DNA Technology 

15 Most proteins are produced naturally in extremely small quantities. The advent of recombinant DNA technology 

has allowed for the production of large quantities of proteins that were previously only available in such small quantities. 

The following describes a "typical" genetic manipulation as it might apply to Escherichia coli, a typical bacterial 
host used for cloning. 

In order to isolate, or "clone", a gene, a DNA library is constructed from a DNA sequence (referred to as a "genome") 

20 using vectors. A "vector" is a small circular molecule of double-stranded DNA that occurs naturally in bacteria, yeast 
and mammalian cells. Vectors generally comprise the following characteristics: (i) a DNA sequence encoding a se- 
lectable "marker" which assures that the vector will be maintained in an appropriate host cell (e.g., E. coli); (ii) a con- 
trollable transcriptional promoter - by "controllable" is meant that the promoter can be "switched on" by manipulation 
of, e.g. the environment of the vector; a "promoter" is a region of DNA sequence that when switched on produces large 

25 amounts of mRNA from the gene of interest inserted into the vector-different promoters (e.g., Jac, trg, tac, etc.) have 
different rates of mRNA production; (iii) translational control sequences, for example, an appropriately positioned ATG 
start codon; and (iv) a polylinker; a "polylinker" simplifies the insertion of the gene of interest in the correct orientation 
within the vector. Vectors can be engineered to provide restriction endonuclease sites on either side of an ATG start 
codon located on the vector such that the gene of interest can be inserted next to the start codon; this allows for 

30 immediate transcription of the gene upon activation of the promoter gene. 

A "restriction endonuclease" is an enzyme which cuts the double-stranded DNA at specified sequences of four to 
eight nucleotides in length, and many restriction endonucleases produce staggered cuts that leave a short, single- 
stranded tail at the location of the cut. This end is referred to as a "cohesive" or "sticky" end because it can form 
complementary base pairs with another sticky end. The genome is cleaved (cut-up) by a specified restriction endo- 

35 nuclease corresponding to the restriction endo-nuclease used to cut the vector, and the individual pieces of the cleaved 
genome are inserted into the vector. Randomly cleaving the entire genome of a cell with a specific restriction endo- 
nuclease is typically referred to as the "shotgun" approach to gene cloning. The shotgun approach can produce an 
extremely large number of DNA fragments, all of which are inserted into vectors. 

The individual pieces of the genome and the vectors, having corresponding sticky ends, are "fused" or "annealed" 

40 together to form circular hybrid DNA "plasmids" comprising a portion of the genome and the vector. 

The plasmids are then introduced into host cells. There are two types of host cells, "eukaryotic" and "prokaryotic". 
An example of a eukaryotic host cell is the Chinese hamster ovary ("CHO"); an example of a prokaryotic host cell is E. 
coli bacteria. For purposes of the discussion to follow, attention will focus on prokaryotic host cells. 

When the plasmids are introduced into the host cell, these cells are referred to as being "transformed" with the 

45 plasmids. As the cells grow and divide, the plasmids will similarly replicate to produce copies of the plasmids containing 
the DNA fragments. Each transformed cell is referred to as a "genomic DNA clone" and the entire collection of trans- 
formed cells containing all of the different DNA fragments is referred to as a "genomic DNA library". 

In order to determine which genomic DNA clones contain the DNA sequence capable of being copied into a cor- 
responding mRNA, it is necessary to separate or "screen" the genomic DNA clones. There are several ways to ac- 

50 complish this task including, for example, the use of radioactive DNA probes or evidence of immunoreactivity Screening 
can be an extremely labor intensive process because, as noted, the shotgun approach by definition leads to the for- 
mation of an extensive number of genomic DNA clones, which must be screened to find potential candidates of interest. 

III. Streptolysin O 

55 

Streptolysin O ("SLO") has an approximate molecular weight of between about 65,000 and about 70,000 daltons. 
SLO belongs to a class of oxygen sensitive ("thiol-activated"), cell destroying ("cytolytic") toxin ("cytotoxin") which are 
produced by gram-positive bacterial species belonging to four different genera (streptococcus, bacillus, Clostridium 



3 



EP 0 555 439 B1 



and listeria). 

SLO interacts with membrane cholesterol and exerts cytolytic-cytotoxic effects on a broad range of mammalian 
cells. Additionally SLO has very potent cardiotoxic properties. One of the toxic and pathogenic properties associated 
with SLO is its hemolytic activity i.e. SLO will lyse red blood cells, resulting in the release of hemoglobin. SLO can be 
s lethal to laboratory animals in relatively small doses. Injection of SLO into an animal typically results in its immediate 
death. 

Because SLO is produced by specified bacterial species, when these species "invade" a mammalian host, the 
SLO released by the bacteria is treated by the host as a foreign protein. SLO, then, is an antigen. "Antigens" are high 
molecular weight compounds which upon entry into the blood stream of a vertebrate stimulate the transformation of 

10 the small lymphocytes of the B-type into lymphoblasts. The lymphoblasts secrete antibodies specific to the antigen 
stimulator. The antibodies are proteins possessing reactive sites specifically complementary to a reactive feature or 
site on the stimulating antigen. Antibodies generally have the property of rendering the antigen harmless to the host 
organism by occupying the immunologically active sites, or "epitopes", on the antigen particles or molecules. Anti-SLO 
antibodies ("ASO") are therefore produced by the host in response to the secretion of SLO into the host. Approximately 

is 80-85% of individuals with current streptococcal infection or their sequelae (an after effect of a disease or injury) will 
demonstrate elevated levels of ASO. 

Determination of previous and/or current infection by the specified bacterial species which secretes SLO is possible 
using immunodiagnostic assaying techniques which, e.g., rely upon the hemolytic properties of SLO and the binding 
of ASO to SLO. Focusing on hemolytic immunodiagnostic assays for SLO, a patient sample is added to a known 

20 amount of SLO derived from a source other than the patient and this mixture is added to a known amount of red blood 
cells such as, for example, rabbit red blood cells. Because SLO has hemolytic properties, it will lyse these red blood 
cells. However, when ASO binds to SLO, the hemolytic properties of SLO are neutralized. Thus, if the sample is obtained 
from a patient having current streptococcal infection or their sequelae, there will be elevated levels of ASO in the 
sample. Accordingly, if the mixture results in high levels of hemolytic activity, this indicates that there is little, if any, 

25 ASO in the serum sample (and hence little, if any infection from the SLO secreting bacteria) because the known 
quantity of SLO in the mixture is capable of lysing the known quantity of red blood cells in the mixture. If the mixture 
does not lead to hemolytic activity, this is indicative of an amount of ASO in the sample sufficient to inactivate the known 
quantity of SLO in the mixture. Investigators refer to such an amount of ASO as a "titer". Typically, an ASO titer of 
greater than about 300 International Units/ml is indicative of infection by a bacterial source capable of secreting SLO. 

30 Other immunodiagniostic assays for determination of infection by SLO secreting bacteria include nephelometric and 
turbidimetric protocols. 

In order to utilize the immunodiagnostic assaying technique outlined above, it is necessary to have access to 
sufficient SLO to be added to the mixture. One source of SLO is culture broths containing the bacteria Streptococcus 
pyogenes ("S. pyogenes"). However, obtaining SLO in this manner is quite difficult and costly: for every liter of the S. 

35 pyogenes culture broth, only about 0.5mg of SLO can be expected; the typical media for growing S. pyogenes is ex- 
pensive; S. pyogenes is a class 2 pathogen; and SLO obtained in this manner contains many other antigenic materials. 
Additionally, SLO obtained by this procedure tends to be unstable in liquid form. Accordingly, such SLO preparations 
are most typically supplied as lyophilized powder in vials. Before use, the lyophilized powder must be reconstituted in 
a suitable solvent. Unfortunately such reconstituted SLO will rapidly lose its hemolytic activity and therefore it must 

40 be used within a brief period after reconstitution or discarded. This has one notable and negative consequence: it is 
usually impossible to test individual serum samples as soon as they are obtained. Thus, laboratories which conduct 
ASO assays based upon hemolytic activity typically store the individual samples until a sufficient number are collected 
to enable economic use of the lyophilized SLO. This can result in an inordinate delay in obtaining test results. 

ASO assays which rely upon nephelometric or turbidimetric protocols need significant amounts of purified SLO. 

45 Because of the costs associated with obtaining significant quantities of purified SLO from S. pyogenes is expensive, 
the foregoing hemolytic based assay was the first ASO assay to become commercially available. 

Recombinant DNA techniques for obtaining SLO fusion products offer the benefit of obtaining relatively large quan- 
tities of such products. Using such technology, it would be possible to avoid the tedious and cost-ineffective aspects 
of obtaining SLO from S. pyogenes. As used herein, the term "SLO derivative" is an SLO fusion product which is 

50 soluble, hemolytically active and which is capable of being bound by at least one antibody to wild type SLO. SLO 
derivatives are designated herein as "rSLO". These SLO derivatives are provided in large quantities, are substantially 
pure, and maintain hemolytic activity. 

Such SLO derivatives would be beneficial, e.g., in immunodiagnostic assays which rely upon, e.g, the hemolytic 
properties of wild-type SLO. 

55 

SUMMARY OF THE INVENTION 

The present invention provides SLO variants. These variants, designated herein as "mSLO", comprise the following 
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characteristics and are broadly defined thereby: (i) recognized by wild-type anti-streptolysin O antibodies (ASO), i.e. 
comprising at least one epitope characteristic of wild-type Streptolysin O; and (ii) substantially non-hemolytic activity. 
As used herein, the term "recognized" means capable of binding to at least one epitopic site on mSLO; the phrase 
"substantially non-hemolytic activity" means a percent wild-type SLO specific activity of less than about 75% based 
$ upon a wild-type SLO specific activity of 4x1 0 5 hemolytic units/mg wild-type SLO; and "wild-type SLO" is accorded the 
usual definition associated with such a phrase, i.e., SLO that is naturally secreted by a capable bacterial source. "Wild- 
type SLO", by definition, does not include, e.g., SLO fusion products derived via recombinant DNA techniques. 

A particularly useful rSLO in accordance with the present disclosure is designated herein as rSLO.3 having a 
specific hemolytic activity of about 3.6 x 10 4 hemolytic units ("HU") per mg. 

10 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a single strand of the nucleic acid sequence of a most preferred embodiment of an SLO derivative, des- 
ignated rSLO.3; and 

15 

Fig. 2 is the amino acid sequence of rSLO.3. 
DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

20 As used in this disclosure, Streptolysin O derivatives, or "rSLO", comprise the following characteristics and are 

broadly defined thereby: (i) capable of being bound by at least one antibody to wild-type SLO; (ii) soluble upon expres- 
sion; and (iii) hemolytically active. As used herein, the phrase "wild-type SLO" is accorded the usual definition asso- 
ciated with such phrase, i.e., SLO that is naturally secreted by a bacterial source capable of secreting such protein. 
Preferably, the hemolytic activity of rSLO has about 75% of the hemolytic activity of wild-type SLO, based upon a wild- 

25 type SLO specific activity of 4x1 0 5 hemolytic units/mg. More preferably, the hemolytic activity of rSLO is between about 
5% and 50%, and most preferably about 9%, of the hemolytic activity of wild-type SLO, based upon a wild-type SLO 
specific activity of 4x1 0 5 hemolytic units/mg wild-type SLO. These values are relative; thus, if percent wild-type SLO 
specific activity is based upon a wild-type SLO specific activity of 1x10 6 hemolytic units/mg, the above values are 
decreased by a factor of 2.5 (i.e., 75% becomes 30%; 9% becomes 3.6%; etc). 

30 The preceding is detailed because the "specific activity" of wild-type SLO has been described as being as high as 

about 1x10 6 hemolytic units/mg, although specific activity of about 4x1 0 5 hemolytic units/mg has also been described. 
Alouf, J. E. "Streptococcal Toxins (Streptolysin O, Streptolysin S, Erythrogenic Toxin)." Pharmac. Ther. II: 661-717 
(1980), which is incorporated herein by reference. Accordingly, because the reported "specific activity" of wild type 
SLO is elusive, the foregoing percentages accomodate this fact. 

35 For convenience, as used herein, the term "vector" means a circular DNA macromolecule comprising at least one 

restriction site and at least one promoter gene. The term "plasmid" means a vector further comprising a portion of a 
genome of interest, including, interalia, a gene. The term "host" means a cell capable of being transfected by a plasmid. 

As those in the art appreciate, most vectors are selected with respect to a desired outcome. For example, in a 
commercial setting, high-level expression of the gene of interest will typically be preferred such that vectors with an 

40 appropriate promoter conducive to such expression will be chosen; on the other hand, in a research setting, such high 
level expression may not be critical such that a vector having translational and transcriptional signals that are under 
the control of regulatory elements of the host may be appropriate. Accordingly, in selecting a vector appropriate for the 
desired outcome, it is often useful to concurrently focus on the promoter gene of the vector of interest. 

Promoter genes which achieve very high levels of mRNA production include, for example, p L , P tac , and p T7 . This 

45 list is not intended, nor is it to be construed, as an exhaustive list. Rather these promoters are used as exemplars for 
purposes of the discussion to follow. Those in the art can readily select an appropriate vector having a desired promoter 
which can provide equivalent results vis-a-vis the listed promoters. 

For example, p T7 is used in conjunction with T7 RNA polymerase which synthesizes RNA at a rate several times 
that of E. coli RNA polymerase and which terminates transcription less frequently than E. coli RNA polymerase. T7 

so RNA polymerase is highly selective for initiation at its own promoter sequence; accordingly, it does not initiate tran- 
scription from any sequences on E. coli DNA. Furthermore, T7 RNA polymerase is resistant to antibiotics such as 
rifampicin that inhibit E. coli RNA polymerase. Therefore, the addition of rifampicin, for example, to cells that are pro- 
moting T7 RNA polymerase results in the exclusive expression of genes under the control of a T7 RNA polymerase 
promoter, i.e., p T7 . 

55 Expression using the T7 RNA polymerase/p T7 system relies upon (typically) a two-plasmid system: the first plasmid 

comprises the gene to be expressed and p T7 ; the second plasmid comprises the gene for T7 RNA polymerase. The 
second plasmid, e.g. pGP1-2 (which comprises the gene for T7 RNA poylmerase; see Tabor and Richardson, Proc. 
Natl.Acad.Sci. U.S.A. 82 : 1074-1078(1985)), can either permanently reside in E. coli or can be introduced into E. coli 
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with a specialized phage, such as, e.g., an M13 vector (such as, e.g., mGP1-2, see Tabor and Richardson), or a A, 
vector (such as, e.g., CE6, see Studier and Moffett, J.Mol.Biol. , 189 : 1 1 3-1 30(1 986)) comprising the T7 RNA polymer- 
ase gene. 

Typically, the second plasmid comprising the T7 RNA polymerase gene is under the control of a heat inducible E. 
s coli promoter, i.e., by raising the temperature from, e.g., 30°C to 42°C, the heat inducible E. coli promoter is switched 
on, which will in turn switch on the p T7 promoter of the first plasmid, thereby leading to the expression of, e.g., the gene 
of interest. Thus, when using a T7 RNA polymerase/p T7 expression system, the E. coli system comprises a heat- 
inducible promoter, such as, for example lambda P L with a Cl 857 repressor. 

Examples of vectors comprising p T7 include, e.g., the pT7 series (pT7-5, pT7-6, and pT7-7, which are derivatives 
10 of pT7-1 ; see Tabor and Richardson, supra. ) and the pET series (see Studier et al., Methods Enzvmol. 185 :60 - 89 
(1990)). 

Another vector system comprises a p L promoter gene. The p L promoter is derived from the X bacteriophage and 
is one of the most powerful regulated E. coli promoter. Transcription from p L can be fully repressed and therefore 
plasmids comprising p L can be stabilized by the X repressor, cl. This repressor is typically supplied by an E. coli host 

15 which comprises an integrated copy of a portion of the X genome. Such an E. coli host, referred to as an "E. coli lysogen" 
is characterized as follows: (i) it supplies the X regulatory proteins cl and N (an anti-termination function); and (ii) it 
does not provide lytic components that would normally lead to cell lysis. Accordingly, E. coli lysogens transfected with 
plasmids comprising, e.g., a gene of interest and p L , can be grown initially to high density without expression of the 
gene and subsequently induced to synthesize the protein under inactivation of the repressor. Examples of p L based 

20 vectors are described in, e.g., U.S. Patent No. 4,925,799 ("pAS1 "), Shatzman and Rosenberg, "The pAS Vector System 
and Its Application to Heterologous Gene Expression in Eschericia coli ." Heptalogy 7:305-355(1 987), and Rosenberg 
et al., "The Use of pKC30 and its Derivatives for Controlled Expression of Genes." Methods Enzymol 101 : 123-139 
(1983). 

The p tac promoter is a hybrid promoter based on the tac and Jac promoters, de Boer, et al. "The tac promoter: A 
25 functional hybrid derived from the trp and Jac promoters. " Proc.Natl.Acad.Sci.USA 80:21 -25(1 983); see also , Amann, 
et al. "Vectors bearing a hybrid tno -jac promoter useful for regulated expression of cloned genes in Eschericia coli ." 
Gene 25:167-178(1983). Because p tac includes the jac operator region, it can be repressed by E. coli strains that 
overproduce the Jac repressor, and be fully induced by addition of isopropyl p-D-thiogaloctoside (IPTG) thereto. 
All of the foregoing references are incorporated herein by reference. 
30 The choice of an appropriate vector/host system is within the realm of the particular needs of the artisan. A most 

preferred vector is based upon the p L promoter. Table I sets forth a representative (not exclusive) list of suitable vectors 
and hosts, as well as the sources thereof. 



TABLE I 



35 


Vector 


Host* 


Source 




pBTad DNA 


JM101, JM105, 
JM107, JM109 


(1) 


40 


pBTac2 DNA 


JM101, JM105, 
JM107, JM109 


(1) 




pNH8A 


D1210PH, D1210 


(2) 


45 


pNH16A 


D1210PH, D1210 


(2) 




pNH18A 


D1210PH, D1210 


(2) 


50 


pPROK-1 


JM109 


(3) 




pEX2 


N4830-1 


(3) 


55 


* = E. COLI CELL 

(1) = BOEHRINGER MANNHEIM 

(2) - STRATAGENE CLONING SYSTEMS 





(3) - CLONETECH LABORATORIES, INC. 
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TABLE I (continued) 



Vector 


Host* 


Source 


pUC19 


JM101, JM105, 
JM107, JM109 


(4) 


p33 


AR120, AR58 


(5) 


p 33 


AR120, AR58 


(5) 


p P L -Lamda 


N99cl+-N4830-1 


(6) 



* = E. COLICELL 

(4) = BETHESDA RESEARCH LABS 

(5) = SMITHKLINE BECKMAN NOW SMITH KLINE BEECHAM 

(6) = PHARMACIA LKB 



For the following examples, the vectors pA33 and pBTac2 DNA were utilized in conjunction with the host strains 
AR120 and JM105, respectively, for the subcloning (initially from pUC19 vector) and expression of rSLO.3. 

EXAMPLES 

The following Examples directed to preferred embodiments are not intended, nor are they to be construed to be, 
limitations on the disclosure of the claims to follow. 

Example 1 

Preparation of Partially Digested Genomic Streptolysin O DNA 

Genomic DNA was isolated from Streptococcus pyogenes (ATCC #1 0389) using the technique described in Kehoe, 
M. et al. Infect. Immun. , 55:3228-3232 (1987) (hereinafter "Kehoe, 1987"), which is incorporated herein by reference. 
Approximately Img of S. pyogenes DNA was obtained using this procedure (925u.g). 

To 370uJ of S. pyogenes DNA (2.5u.g/uJ) was added 300jllI of 10X High Salt Buffer (1.0M NaCI; 100mM tris-hy- 
droxyamino methane-chloride ("TRIS-CI"), pH7.5; 100mM MgCI 2 ; and 10mM dithriothreotol ("DTT")), 2310jal of deion- 
ized H 2 0 and 20uJ of Bgl II (BRL, Gaithersburg, MD, Cat. # 521 3SA), for a final volume of 3000uJ. This mixture was 
maintained at 37°C and incubated overnight. 

To this incubated mixture was added 3000uJ of Reagent A (250uJ phenol, 250uJ chloroform, 1 OjllI isoamyl alcohol, 
1 jlxI p-mercapthoethanol). This mixture was agitated prior to centrifugation in order to separate the aqueous and the 
organic layer. The aqueous supernantant was then precipitated with .3M NaOAc and 95% ethanol. The precipitate was 
then redissolved in 250jllI TE (10mM TRIS-CI, pH 7.5; 1mM EDTA) and 25jllI of 10X loading dye (0.2M EDTA; 50% 
glycerol; 0.25% xylene cyanol; 0.25% bromophenol blue) was added thereto, followed by electrophoresis on 1% aga- 
rose gel. The Bgl II partially digested S. pyogenes genomic DNA fragments were then evaluated according to size. 

As noted, SLO has an approximate molecular weight of 65,000 to 70,000 daltons. Each amino acid has an ap- 
proximate molecular weight of 110 daltons, such that (conservatively estimating) a 70,000 dalton protein would be 
encoded by approximately 636 codons, or 1909 base pairs. Accordingly, the partially digested fragments of between 
about 2,000 to 2,500 base pairs (i.e. , 2.0 to 2.5 Kb), as determined by the aforementioned gel electrophoresis method, 
were purified. The purified fragments were then resuspended in 150uJ of TE. For convenience, these are designated 
herein as "SLO inserts". 

Example 2 

Preparation of Streptolysin O Containing Plasmids 

The vector utilized was pUC19 (BRL, Cat. # 5364SA) cut with Bam HI (BRL, Cat. # 5201 SA). 

To 1jllI of cut pUC1 9 vector was added 15uJ of the SLO inserts, 3jlxI of 10X ligation buffer (660mM TRIS-CI, pH 7.5; 
50mM magnesium chloride; 10mM DTT; 10mM ATP). A final volume of 30uJ was achieved by the addition of SjlxI of 
deionized H 2 0. To this mixture was added 2\i\ of T4 ligase (USB, SjLxg/jLLl); incubation thereof at room temperature 
proceeded overnight. For convenience, the resulting material is designated as "SLO plasmid candidates". 
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Example 3 

Screening of SLO Plasmid Candidates 

s Host cells E. coli strain JM105 were transformed with the SLO plasmid candidates as follows. A vial containing 

300uJ of frozen JM105 competent cell was thawed, and 1 S. OjllI of the SLO plasmid candidates was added thereto. This 
admixture was incubated on ice for 30min, followed by heat shock in a 37°C water bath for 2min. Thereafter, the 
transfected JM105 solution was added to 2ml of LB medium (10g Bacto-tryptane; 5g Bacto yeast extract; 10g NaCI; 
1 liter deionized water; pH 7.5 with sodium hydroxide), followed by shaking (200RPM) for 30min at 37°C. Plating was 
10 thereafter accomplished on LB Ampicillin plates, followed by incubation overnight at 37°C; for convenience, these are 
designated "SLO transformants". 

Screening was accomplished utilizing a unique procedure. Following overnight growth, the colonies were overlaid 
with 3ml of 2.5% washed rabbit red blood cells in 0.8% agarose in PBS/IOmM DTT, which was spread to cover the 
plates. After 40min of incubation at 37°C, colonies comprising SLO were surrounded by small zones of hemolysis. In 
15 order to confirm that these colonies comprised SLO, a 25-mer oligonucleotide probe derived from nucleotides 670 
through 694, inclusive, of the reported DNA sequence of SLO (see Kehoe, 1 987) was used as a probe. The probe was 
prepared with a BioSearch 8600 DNA synthesizer, and labelled with 32 P following the T4 polynucleotide kinase pro- 
cedure described in Maniatis et al., Molecular Cloning , CSPL (1982), pp. 122-126 (hereinafter "Molecular Cloning "). 
The blood overlay screening technique proved to be an efficient and accurate method for rapidly screening the 
20 SLO expressed by the SLO transformants. Because a property of SLO is the ability thereof to lyse red blood cells, red 
blood cells from any source can be utilized, i.e., human, mouse, goat, rabbit, etc. Rabbit red blood cells are preferred 
due to the availability thereof. 

An SLO clone that led to the expression of protein which evidenced hemolytic activity and which hybridized with 
the 25-mer probe was designated "pUC1 9-SLO-B". For convenience, the non-vector DNA sequence thereof is desig- 
ns nated herein as "rSLO-candidates". 

Example 4 

Optimization of Expression and Determination of Solubility 

30 

In order to optimize the expression of rSLO-candidates, timed-digestion of rSLO-candidates using Bal-31 was 
accomplished. Additionally, and as previously noted, solubility of the expressed protein ab initio , i.e., without further 
chemical modification once expressed, is of import. This is because non-soluble SLO is by definition inactive. Accord- 
ingly, an analysis was also made to determine if the expressed protein was soluble, i.e. was located in a supernatant 

35 as opposed to a pellet, following centrifugation. 

The pUC1 9-SLO-B was initially cut with BstE II (New England Bio Labs, Cat. #162, 10 U/uJ) as follows. To 20uJ 
of pUC1 9-SLO-B (2.5u,g/u,l) was added 40uJ of 10X High Salt Buffer, 335uJ deionized H 2 0, and 5uJ of BstE II. This 
admixture was incubated at 60°C for 2hrs, followed by extraction with 400uJ of Reagent A, and precipitation with 44jal 
of 3M NaOAc (pH 4.8) in 888uJ of 95% ethanol. The precipitate was then redissolved in 40liI H 2 0. Thereafter, 90jllI of 

40 H 2 0, 20uJ of 1 0X Bal-31 Buffer (1 20mM CaCI 2 ; 1 20mM MgCI 2 , 2.0M NaCI; 0.2M TRIS-C1 , pH 8.0; 1 0mM EDTA), and 
50uJ of Img/ml Bovine Serum Albumin, was admixed with the redissolved precipitant. This was followed by the addition 
of 1 OjlxI of Bal-31 (New England Bio Labs, Cat. # 213, 100 U/ml), for a total of 210liI, followed by incubation at room 
temperature. To control the effects of Bal-31, 30jllI aliquots of the 21 OjlxI total solution was removed at 30, 45, 60, 80, 
105, 130 and 160min post-Bal-31 addition, and these aliquots were each admixed in 3.3uJ of 0.2M EGTA, followed by 

45 storage on ice. After preparation and storage of the last aliquot, all seven aliquots were pooled, extracted with 230uJ 
of Reagent A, and precipitated with 23uJ of 3M NaOAc in 506uJ of 95% ethanol. The precipitate was then redissolved 
in 75uJ of H 2 0. 

A fill-in reaction followed by the addition of 5uJ of 2.5mM dXTP, 1 OjllI of 10X Medium Salt Buffer (500mM NaCI; 
100mM TRIS-CI, pH 7.5; 100mM MgCI 2 ; 10mM DTT), and 1 OjllI of 100mM of DTTto75uJ of the redissolved precipitate, 
50 followed by the addition thereto of 6uJ of Klenow polymerase (5 U/uJ), and incubation at room temperature for 4hrs. 
This admixture was extracted with 1 OOjlxI of Reagent A, precipitation with 11jllI of 3M NaOAc in 22uJ of 100% ethanol, 
and resuspension of the precipitate in 40|llI of H 2 0. 

Following the fill-in reaction, 17uJ of the resuspended precipitate was admixed with 3lxI of a linker comprising a 
Bam HI sequence (New England Bio Labs, Cat #1021) and 5|llI of 5X linker ligation buffer (250mM TRIS-CI, pH 7.6; 
55 50mM MgCI 2 ; 5mM DTT; 5mM ATP; 2.5% (w/v) PEG 8000 (J.T.Baker, Cat. # U222-09)). This was followed by the 
addition thereto of 2uJ of T4 ligase (5 U/jlxI), and incubation thereof for 6hrs at room temperature. For convenience, the 
resulting material is referred to as "ca/ew". 

E. coli strain JM105 was transformed with ca/ew as described above, followed by overnight growth as described 
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above in Example 3. To determine if the plasmids comprised the Bam HI linker, to 40jlxI of ca/ew (0.5jLxg/jLxl) was added 
40jllI of 10X Medium Salt Buffer, and 320uJ deionized H 2 0. To this mixture was added 5uJ EcoRI (BRL, Cat # 5202 SA, 
10U/uJ), followed by incubation at 37°C for 2hrs. In order to ensure that the plasmid was cut, gel electrophoresis (1% 
agarose gel) was conducted; this resulted in a smear of different sizes, indicating a successful cut. To the cut plasmid 

s was added SjlxI of 5M NaCI, followed by 5uJ of Bam HI (10 U/jllI). This mixture was incubated for 37°C for 2hrs. Deter- 
mination of the size of the rSLO-candidate sequence subjected to Bal-31 digestion was conducted by gel electrophore- 
sis (1 % agarose gel). This resulted in a band of interest at about 1 .2 to about 2.0 Kb which comprised rSLO-candidates. 
Thus, the initial fragments of 2.0 to 2.5 Kb which evidenced hemolytic activity had been significantly decreased in size. 
The band which comprised rSLO-candidate was cut from the gel and purified in 1 5uJ of TE such that rSLO-candidate 

10 was available for ligation in pUC1 9 vector previously cut with Bam HI and ECoRI. In order to accomplish such ligation, 
1 0jal of the gel-purified rSLO - candidate was admixed with 4uJ of the previously prepared vector, 2uJ of 10X ligation 
buffer, 2uJ of "lOmM ATP, and 2uJ of deionized H 2 0. To this admixture was then added 2uJ of T4 ligase, followed by 
incubation at room temperature for 6hrs. E. coli host cell strain JM105 was transformed with these plasmids as above, 
and active colonies were screened by the red-blood cell overlay method disclosed above. Active colonies were then 

15 selected, innoculated in LB Medium/1 OOjag/m I Ampicillin and grown overnight under the conditions described above. 

Following overnight growth, the cells were centrifuged for 5min at 8000RPM at 4°C, and the resulting pellet resus- 
pended in 2ml of Reagent B (1 50mM NaCI; 20mM TRIS, pH 7.0; 1 mM EDTA). Thereafter, the resuspended cells were 
subjected to sonication for 2X 30sec. on ice, followed by centrif ugation at 9500RPM for 40min at 4°C using a Beckman 
JA20.1 centrifuge to obtain the expressed protein. 

20 At this stage, if the rSLO-candidate led to the expression of a soluble protein, that protein would be located in the 

supernatant. Accordingly, analysis was conducted for the presence of rSLO-candidate in the supernatant using stand- 
ard Western blot protocols for determination of an antigenically active protein. The results of such Western blot analysis 
indicated that there was an SLO fusion product in the supernatant which was recognized by horse anti-SLO antibodies. 
One such fusion product was selected and designated "rSLO.3" For convenience, the DNA sequence leading to the 

25 expression of rSLO.3 is also referred to as rSLO.3. High level expression of rSLO.3 was thereafter attempted. 

Example 5 

High Level Expression of rSLO.3 

30 

Removal of rSLO.3 from the plasmid comprising pUC1 9 vector was effectuated as follows. To 15uJ of the plasmid 
comprising rSLO.3 (0.5u,g/uJ) was added to 40llI of 10X Sma I Buffer (200mM KCI; 100mM TRIS-CI, pH 8.0; 100mM 
MgCL 2 ; 10mM DTT), and 345uJ of deionized H 2 0. To this mixture was added 5uJ Sma I (BRL, Cat # 5228 SA, 10U/ 
uJ), followed by incubation at 37°Cfor2hrs. In order to ensure that the plasmid was cut, gel electrophoresis (1% agarose 

35 gel) was conducted; this resulted in a single band, indicating a successful cut. To the cut plasmid was added SjlxI of 5M 
NaCI, followed by 5jllI of Bam HI (1 0U/jllI). This mixture was incubated for 37°C for 2hrs. To ensure that the rSLO.3 
sequence was successfully cut from the approximately 2.7Kb pUC 19 vector, gel electrophoresis (1% agarose gel) 
was conducted. This resulted in two bands, one at about 2.7Kb (the vector), and the other at about 1.4Kb (rSLO.3). 
This band was cut from the gel and purified in 1 SjlxI of deionized H 2 0 such that rSLO.3 was available for ligation in 

40 pA33 vector previously cut with Bam HI and Sma I. 

To 2llI of above-derived rSLO.3 DNA was added 2uJ of the above described vector, 1.5uJ of 10X ligation buffer, 
1 SjlxI of 10mM ATP, and 8liI of deionized H 2 0. To this admixture was added 2jllI of T4 ligase (10 U/jllI), followed by 
incubation for 5hrs at room temperature. Such incubation resulted in plasmids comprising rSLO.3 and pA33 vector. 
E. coli strain AR1 20 was transformed with the above-described plasmids in accordance with the procedure outlined 

45 for E. coli strain JM105. Thereafter, a DNA mini prep described in Current Protocols in Molecular Biology, Auschel, F. 
M. et al., Eds, John Wiley & Sons (New York) (1987), Section 1.6., was effectuated followed by cutting the plasmids 
with Bam HI and Sal I (BRL, Cat. # 5217 SA) to determine if the plasmids comprised rSLO.3. Those host cells trans- 
formed with plasmids comprising rSLO.3 were then subjected to induction via the nalidixic acid protocol. See Mott, J. 
E. et al "Maximizing gene expression from plasmid vectors containing the >*p L promoter: Strategies for overproducing 

50 transcription termination factor p." PNAS USA , 82 : 88-92(1985), which is incorporated herein by reference. As those 
in the art appreciate, nalidixic acid, which damages DNA, induces recA protein, a recovery protein for E. coli. A derivative 
benefit vis-a-vis overexpression is that recA has protease activity which, inter alia , leads to inactivation of Xc\ + repres- 
sor; this inactivation leads to overexpression by the p L promoter. 

Specifically, colonies comprising the transformed AR120 were lifted from the agar plates and innoculated in Su- 

55 perbroth(Base - 1 2g tryptone, 24g yeast extract, 5ml glycerol, 900ml distilled H 2 0; Salt (per liter of base) - 1 .7g KH 2 P0 4 , 
1 5.8g K 2 HP0 4 (anhydrous) 1 00ml distilled H 2 0) plus 1 0OLig/ml ampicillin at 37°C until the optical density of the medium 
at 

^650 ©quailed 0.4. Thereafter, nalidixic acid was added to the innoculated mixture at a final concentration of 60Lig/ 
ml and incubated at 37°C for 4hrs. Western blot analysis of the supernatant demonstrated the presence of rSLO.3 
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The DNA and amino acid sequences of rSLO.3 was thereafter determined (Lark Sequencing Technologies, Hou- 
ston TX). A single stranded representation of the determined DNA sequence of rSLO.3 is presented in Figure 1 and 
the determined amino acid sequence of of rSLO.3 is presented in Figure 2. 

s Example 6 

Specific Activity of rSLO.3 

Protein concentration and specific activity of non-purified SLO.3 was determined immediately following nalidixic 
10 acid induction. 

Protein concentration for rSLO.3 crude extracts was derived using the BioRad Protein Assay method (Coomassie 
Blue G-250). Nalidixic acid induced protein mixtures were centrifuged at 8000RPM for 5min at 4°C and the pellets 
resuspended in 500uJ sonication buffer (40mM TRIS, pH 7.5; 1 mm EDTA; 1 mM DTT; 200 mM NaCI). The resuspended 
pellets were then sonicated for 2X 30sec on ice, followed by centrif ugation at 1 2,000RPM for 40min at 4°C. Thereafter, 

15 5jlxI of the resuspended rSLO.3 mixture was analyzed for protein concentration (OD reading at A 595 ), and the protein 
concentration was determined to be 4.6|ug/uJ. 

Specific activity was determined by serial dilutions of the above described crude extract and addition thereto of 
washed rabbit red blood cells ("RRBC"), followed by spectrophotometric reading (OD reading at A^-,). 5ml of fresh 
rabbit blood was washed 2X with 45ml of PBS including "lOrnM DTT, followed by centrifugation at 2000RPM for 5min 

20 at 4°C. Thereafter, 1 . 1 25ml of the washed rabbit red blood cells ("RRBC") were drawn from the bottom of the tube and 
48.875 ml of PBS/1 OmM DTT was added thereto. This resulted in a solution comprising 2.25% RRBC. For the hemolytic 
assays, 500uJ of the 2.25% RRBC was added to 500uJ of 1 :2 serially diluted rSLO.3 in PBS/1 OmM DTT, followed by 
incubation at 37° for 30min. 

These serial dilutions were spectrophotometrically analyzed (OD readings at A 541 ). This analysis indicated that . 
25 2uJ of the diluted rSLO.3 crude extract caused 50% hemolysis of the RRBC; .2uJ of the diluted extract is equivalent to 
2uJ of the extract itself. Accordingly, the rSLO.3 crude extract evidenced one hemolytic unit ("HU") per two microliters, 
or 500 HU/ml. 

As noted, the protein concentration of the crude extract was determined to be 4.6mg/ml. Accordingly, the specific 
activity of rSLO.3 derived from the pA33-AR120 expression system was 108.7HU/mg. It is noted that because these 
30 are values for a crude (i.e. non-purified) extract, these values are predicated upon total protein concentration of the 
extract. For a purified extract, the specific activity values increase. 

Example 7 

35 Recovery of rSLO.3 

The following procedure is for approximately 200 grams of transformed host cells (i.e., approximately 6 grams total 
protein). 

Transformed host cells were resuspended in 200mls of Reagent C (40mM TRIS, pH 7.5; 1 mM EDTA; 0. 1 % 2-mer- 
40 captoethanol), followed by the addition of 100mM PMSF Thereafter, the cells were disrupted by sonication, followed 
by the addition of 4ml of 100mM PMSF. This admixture was centrifuged for 30min at 4°C at 15,000RPM. 

The resulting supernatant was removed and saved; 200ml of Reagent C was added to the pellet, followed by the 
addition of 4mls of 100mM PMSF. The resuspended pellet was then sonicated, followed by centrifugation as above. 
The resulting supernatant was then removed and pooled with the previous supernatant, and the pH thereof was adjusted 
45 to 7.0 with NaOH. 

To the final volume of supernatant was slowly added (with stirring at room temperature) Polymin P (Aldrich Chem- 
icals) to a final concentration of 0.75%. This admixture was then centrifuged for 30min at room temperature at 10,000 
RPM, followed by retrieval of the supernatant. Solid sodium sulfate was slowly added with stirring to 80% saturation 
of the supernatant. 

50 Thereafter, the admixture was stirred for 2hrs at 4°C, followed by centrigation for 30min at 4°C at 15,000 RPM. 

The pellet was then retrieved and resuspended in 400mls of saturated ammonium sulfate, pH 7.0. The admixture was 
then centrifuged for 30min at 4°C at 1 0,000 RPM, followed by retrieval of the pellet and resuspension thereof in 200mls 
in Reagent D (20mM TRIS, pH 7; 1mM EDTA; 0.1% 2-mercaptoethanol). 

The resuspended pellet was then dialyzed against 2 liters of Reagent D, with 4 changes, at 4°C. Sufficient room 
55 was left in the dialysis bag in that the volume of the sample increases. Following dialysis, the pH of the sample was 
checked, and adjusted to 7.0 with NaOH. 

The sample was then loaded onto a Pharmacia Fast Flow S-Sepharose column equilibrated in Reagent D. A 400ml 
bed volume was found to be sufficient to remove the mSLO.3/6 from the sample. The flow through, comprising E. coli 
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proteins, was collected and discarded, and the column was washed with approximately 1 liter of Reagent D. 

The rSLO.3 was eluted with 2 x 1 liter 0.0 to 0.4M NaCI gradient in Buffer B. The fractions were analyzed by SDS 
acrylamide gel (9%), and fractions with high amounts of rSLO.3 were pooled. Approximately 250ml of pooled rSLO.3 
was recovered. 

s Using the above procedure, approximately 60% of the original total protein (i.e. approximately 0.36 grams) was 

rSLO.3, which can be stored at 4°C until needed. 

Example 8 

10 Purification of rSLO.3 

Purification of rSLO.3 was accomplished to a purity of at least 80% using the following protocol. 

Approximately 600g of frozen cell paste derived in accordance with the protocol described in Example 9 was 
thawed (37°C), resuspended in 3 liters of cold lysis buffer (40mM TRIS-CI, pH 7.0; Imm EDTA; .1% 2-mercaptoethanol; 
15 2M Nad; 4°C) and sonicated for 60min at 4°-10°C with a Heat Systems Ultrasonics Continuous Flow sonicator 
(Farmingdale, N.Y, No. W-385). Thereafter, the material was centrifuged on a Beckman JA10 centrifuge at 9500RPM 
for 40min at 20° to 26°C. Approximately 3 liters of supernatant was retrieved. 

To the supernatant was added a 1 2.5% stock solution of Polymin P precipitant (Aldrich, Milwaukee, Wis.) to a final 
concentration of between 0.2 to 0.3%. The solution was then stirred for Ihr at room temperature and the precipitate 
20 discarded. The pH of the liquid portion was then adjusted to 7.0 with NaOH. This liquid was then permitted to stand 
overnight at room temperature. 

Thereafter, the solution was centrifuged as above, and a clear supernatant retrieved. The supernatant was then 
loaded onto a 1 liter phenyl-sepharose HIC column (Pharmacia, Piscataway, N.J.) at 2ml/min. at room temperature. 
Thereafter, the column was washed with an elution buffer (20mM TRIS-CI, pH 7.0; ImM EDTA; 0. 1 % BME) at 7ml/min. 
25 Fractions were monitored by SDS-PAGE electrophoresis using the Pharmacia Phast-Page™ System. Protein concen- 
tration was determined with the BioRad Protein Assay Kit. Fractions containing protein were then pooled. 

The pooled fractions was then loaded onto a 1 liter Blue Affinity Column (BioRad, Richmond, California) at 2ml/ 
mm at room temperature, followed by washing using the elution buffer described above at 2ml/min. at room temperature 
for two column volumes. 

30 Elution of bound protein was accomplished using an NaCL density gradient of 0.0 to 0.8M, pH 7.0. Fractions were 

monitored with the Phast-PAGE System and protein concentration determined with the BioRad Protein Assay Kit. A 

single peak was obtained at 0.3-0.4M on the NaCL density gradient. 

Purity of the eluted rSLO.3 was evaluated using a Beckman DU 7500 spectrophotometer, based upon analysis of 

major band homogeneity obtained from gel electrophoresis (12% SDS-polyacrylamide) of six different amounts of the 
35 eluted rSLO.3 (16, 8, 4, 2, 1, .5|ug rSLO.3). The evaluated purity of rSLO.3 based upon major band homogeneity is 

set forth in Table 2: 



Table 2 



rSLO.3 (ug) 


Percent Homogeneity rSLO.3 


0.5 


99.0% 


1.0 


99.0% 


2.0 


94.4% 


4.0 


82.4% 


8.0 


81.4% 


16.0 


80.1% 



For determination of hemolytic activity, the concentration of purified rSLO.3 was determined. A 1:25,600 titer of 
a .7mg/ml concentration of purified rSLO.3 was required to obtain greater than 50% lysis of 2.5% RRBC. Accordingly, 
50 the specific hemolytic activity of purified rSLO.3 is about 3.6 x 1 0 4 HU/mg (25,600 - .7). 

As noted, the specific activity and percent hemolytic activity of specific versions of rSLO (purified rSLO.3), based 
upon the "specific activity" of wild-type SLO, is as follows: 

55 
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TABLE 3 





Wild-Type SLO 


rSLO.3 


Specific Activity (Hemolytic Activity in Hemolytic Units/mg) 
Percent Hemolytic Activity of Wild-Type SLO 


a) 1x10 6 

b) 4x1 0 5 

a) 100 

b) 100 


3.6x1 0 4 
3.6 

9 



Example 9 

]n Vivo Toxicity Effects of rSLO.3 

In order to evaluate in vivo toxicity effects of rSLO.3, Balb/c mice were administered undiluted and diluted intra- 
veneous injections of rSLO.3. Undiluted and diluted control suspension buffer was administered to an equivalent 
number of mice. To improve the intraveneous injections, the mice were warmed under a heat lamp for 20-30 minutes 
of pre-injection. Approximately 20 mice were used for each condition. 

For the undiluted rSLO.3, each mouse received an approximate dosage of 17 mg/kg, while for the diluted rSLO. 
3, each mouse received an approximate dosage of 1000 jag/kg. Control solution buffer did not affect the control mice. 

Aside from minor ruffling for several minutes after injection, none of the mice receiving either diluted or undiluted 
rSLO.3 showed any ill effects from the intraveneous administrations. Thus, while rSLO.3 is hemolytically active, mice 
receiving injections of rSLO.3 as described above did not expire. 

Example 9 

Subcloning of rSLO.3 

Having obtained, verified and sequenced rSLO.3, subcloning and expression thereof using another expression/ 
vector system was initiated. The vector, pBTac 2 DNA (Boehringer Mannheim, Cat. No. 1081381, lOjixg) was cut with 
Hind III (BRL, Cat. No. 52075A, 10 U/ml) by admixing 30uJ of pBTac2 DNA (1 jLxg/jLLl), 30uJ of 10X Medium Salt Buffer, 
240jlxI deionized H 2 0, followed by addition thereto of 5uJ of Hind III (BRL, Cat. # 5207 SA, 10U/jlxI). This admixture was 
incubated for 2hrs at 37°C. Thereafter, the admixture was analyzed by agarose electrophoresis (1% agarose gel) to 
determine if the vector had been successfully cut; a single band indicated that the cut had been successful. 

To the 305uJ admixture was added 300uJ of Reagent A. This admixture was then centrifuged for 5min at 1 2,000RPM 
on a Beckman microcentrifuge, followed by retrieval of the upper liquid layer. To this liquid layer was added 33jllI of 3M 
NaOAc (pH 4.8) and 660uJ of ethanol, followed by precipitation overnight at -20°C. This was followed by centrifugation 
for 1 0min at 1 2,000RPM on a Beckman microcentrifuge. The pellet was retrieved and dried by air. The dried pellet was 
then resuspended in 150uJ of deionized water. 

In order to blunt (fill-in) one end of the Hind III cut vector, the 150uJ solution comprising the resuspended pellet 
was admixed with 1 OjlxI of 20X dNTP (2.5mM), 20uJ of 10X MSB and 20uJ of 100mM DTT This was followed by the 
addition of 4uJ of Klenow polymerase (New England Biolabs, Cat. No. 21 0, 5 U/ml) and incubation at room temperature 
for 7hrs. Thereafter, 300jlxI of Reagent A was added to the incubated mixture, followed by centrifugation for 5min at 
12,000RPM. The upper liquid layer was retrieved and precipitated as above. The dried pellet was then resuspended 
in 30uJ of deionized H 2 0. For convenience, the filled-in, Hind III cut vector is referred to as "vec.rb". 

Thereafter, vec.rb was cut with Bam HI (BRL, Cat No. 5201 SA, 10 U/jlxI). To 30jlxI of vec.rb was added 30jlxI of 10X 
High Salt Buffer and 240uJ of deionized H 2 0. To this admixture was added 5jllI of Bam HI, followed by incubation for 
2hrs at 37°C. To the incubated mixture was added 300jllI of Reagent A, followed by centrifugation as above. The upper 
liquid layer was retrieved and precipitated as above. The dried pellet was then resuspended in 20uJ of deionized H 2 0. 
The resuspended pellet comprised Hind III cut, filled-in, Bam HI cut pBTac2 DNA. 

The rSLO.3 was removed from the plasmid described above as follows. To 40uJ of the plasmid comprising rSLO. 
3 (1 jLxg/1 jllI) was added 10X Smal Buffer, and 320uJ deionized H 2 0. To this mixture was added 5uJ Sma I (10U/uJ), 
followed by incubation at 37°C for 2hrs. In order to ensure that the plasmid was cut, gel electrophoresis (1% agarose 
gel) was conducted; this resulted in a single band, indicating a successful cut. To the cut plasmid was added 8jlxI of 5M 
NaCI, followed by 5uJ of Bam HI (10 U/jlxI). This mixture was incubated for 37°C for 2hrs. To ensure that the rSLO.3 
sequence was successfully cut from the approximately 6.3Kb pA33 vector, gel electrophoresis (1% agarose gel) was 
conducted. This resulted in two bands, one at about 6.3Kb (the vector), and the other at about 1.4kB (rSLO.3). The 
1 .4Kb band was cut from the gel and purified in 20uJ of deionized H 2 0 such that mSLO.3/6 was available for ligation 
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in the prepared pBTac2 vector. 

To 3jlxI of the vector was added 2jllI of mSLO.3/6, 1 .SjlxI of 10X Ligation Buffer (0.66M TRIS-CI (pH 7.5), 50mM 
MgCI 2 , 50mM DTT, 10mM ATP), 1.5uJ of 10mM ATP and 7uJ of deionized H 2 0. Thereafter, 1.5uJ of T4 Ligase was 
added thereto, followed by incubation overnight at room temperature. For convenience, this mixture is referred to as 
5 the "subclone/. 

E. coli strain JM1 05 was transfected with subclone T as follows. A vial containing 300uJ of frozen JM1 05 competent 
cell was thawed, and 8.0jllI of subclone x was added thereto. This admixture was incubated on ice for 30min, followed 
by heat shock in a 37°C water bath for 2min. Thereafter, the transformed JM105 solution was added to 2ml of LB 
medium (10g Bacto-tryptane; 5g Bacto yeast extract; 10g NaCI; 11 deionized water; pH 7.5 with sodium hydroxide), 

10 followed by shaking (200RPM) for 30min at 37°C. Plating was thereafter accomplished on LB Ampicillin plates, followed 
by growth overnight at 37°C. Screening was accomplished using the blood overlay method disclosed above, and 
colonies evidencing hemolysis were selected. 

Selected screened colonies comprising rSLO.3 subclones were innoculted in 12ml of Superbrothampicillin broth. 
Induction was accomplished by the addition of isopropyl-p-D-thiogalactopyranoside ("IPTG"), at a final concentration 

15 of 1mM, to the culture broth when the culture broth had an OD 60 q reading of 0.7. 12ml of the resulting solution was 
centrifuged at 8000RPM for 10min at 4°C and the resulting pellet resuspended in 1.2ml of PBS/1 OmM DTT. The re- 
suspended pellet was sonicated for 1.5min; the protein concentration of the sonicated extract was determined using 
the BioRad Protein Assay protocol described above. The protein concentration was determined to be 9.3mg/ml. This 
data was used to determine the specific hemolytic activity of the sonicated extract by titer based upon the 50% lyses 

20 of 2.5% washed rabbit red blood cell protocol described above. The hemolytic activity based upon titer of the culture 
comprising rSLO.3 was determined to be 2.69 x 10 3 - 

The foregoing Examples are directed to the generation of an SLO genomic library. As those in the art appreciate, 
another type of library which is much less complex than a genomic DNA library is a "complementary DNA", or "cDNA", 
library. cDNA is derived directly from mRNA; therefore, by definition, the cDNA library is comprised of regions of trans- 

25 lation. Methods for deriving cDNA libraries based upon mRNA complementary to mSLO DNA are considered to be 
within the purview of the skilled artisan such that cDNA-based libraries for mSLO are a part of this disclosure. 

PROTEIN FOLDING 

30 As the linear arrangement of nucleotides defines a specific codon, the arrangement, or sequence, of amino acids 

defines the protein, including the particular function thereof. However, while the particular amino acid sequence is 
important with respect to the identity of the protein, the particular three dimensional shape that the protein exhibits is 
of similar import. Such specificity in terms of shape, in essence, co-defines the properties of the protein because the 
shape of the protein enables the protein to specifically interact with other molecules that will only recognize that par- 

35 ticular protein shape. 

Most proteins spontaneously fold into their correct shape. By treating the protein with certain denaturing solvents, 
the protein can "unfold" into a flexible chain. When the denaturing agent is removed, portions of the flexible chain may 
refold into their original conformation. This is because one of the most important factors governing the folding of a 
protein is the distribution of polar (hydrophillic, or "water-loving") and non-polar (hydrophobic, or "water-hating") side 

40 chains of the amino acids of that protein. Denaturing solvents interfere with the polarity of the amino acid side chains. 
The following amino acids have polar side chains: Asn; Gin; Ser; Thr; and Tyr. The following amino acids have non- 
polar side chains: Gly; Ala; Val; Leu; Iso; Pro; Phe; Met; Trp; and Cys. Amino acids with basic and acidic side chains 
are very polar. The following amino acids have basis side chains: Lys; Arg; and His. The following amino acids have 
acidic side chains: Asp and Glu. 

45 The environment in which proteins naturally exist is, by definition, a non-denaturing environment, which is most 

typically aqueous. Accordingly, the hydrophobic side chains of a protein tend to be pushed together in the interior of 
the protein molecule, which enables these to avoid contact with the aqueous environment. Polar side chains, on the 
other hand, tend to arrange themselves near the outside of the protein molecule, where they can interact with water 
and other polar molecules. 

50 While the molecular mechanisms by which a linear DNA sequence is transcribed and translated into a precise 

amino acid sequence of the corresponding polypeptide is well understood, exactly how the polypeptide chain folds 
simultaneously and autonomously into its three-dimensional structure is not clearly understood. However, the real 
potential of synthetic DNA, i.e. DNA synthesized via recombinant techniques, will be realized in the area of protein 
design. In order for this to be realized, however, the mechanism of protein folding will have to be more succinctly 

55 clarified. While the general problem of predicting protein structure from the sequence is elusive (principally because 
no rules have emerged that allow structure to be related to sequence), it is clear that certain portions of the sequence 
are important to the structure and other portions are relatively unimportant from a structural point of view such that 
substitutions or modifications can be made at these portions. Accordingly, it is assumed that portions of the sequence 
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of a protein contribute significantly to the stability of the folded protein structure. 

While predicting a protein structure from the protein sequence is elusive, proteins, by definition, have unique three- 
dimensional structures which can be determined. The following methodologies, for example, can be used in the de- 
termination of protein structure: Crystallography; Optical Activity, Nuclear Magnetic Resonance Spectroscopy. 

5 

a) Crystollagraphy 

Proteins are capable of forming crystals. Proteins usually crystallize in a condition of saturation or super-saturation 
which can be achieved by altering one or more of a number of variables that affect the solubility of the proteins. Thus, 

10 by altering the ionic strength of the solution or by utilization of organic polymers, e.g., polyethylene glycol, proteins can 
be crystallized. Techniques for growing protein crystals are set forth in Narang, S.A. Protein Engineering: Approaches 
to the Manipulation of Protein Folding (Butterworth, Publisher, Stoneham MA., 1990), Chpt. 6 (hereinafter "Narang"). 
The preceding text book is incorporated herein by reference in its entirety. Having crystallized the protein, the techniques 
of x-ray, neutron, and electron diffraction can be used to determine to structure of the protein, with x-ray diffraction 

is being preferred. The protein structure in the crystal is assumed to be at or near the minimum conformational free energy 
of the molecule for the crystal form. 

b) Optical Activity 

20 The optical activity of polypeptides/proteins due to the asymmetric centers of the amino acids and to the asymmetric 

conformations thereof, can be utilized to determine the structure of polypeptides/proteins. This asymmetry causes 
proteins to interact differently with right- and left-circularly polarized light; if the two beams consequently travel at 
different speeds through the protein, polarized light is rotated. Optical rotatory dispersion ("ORD") is the dependence 
of this rotation upon wavelength. In a wavelength region where the protein molecule does not absorb light, the rotation 

25 varies gradually with wavelength, but in an absorbance region, the rotation first increases sharply in one direction, falls 
to zero at the absorption maximum, and then rises sharply in the opposite direction. There will also be unequal ab- 
sorption of left- and right-circularly polarized light; this is referred to as circular dichroism ("CD"). Both CD and OES 
spectra of a protein are very sensitive to the structural conformation thereof. Folded proteins generally have significant 
optical activity in the near-UV region (250-300nm). 

30 

c) Nuclear Magnetic Resonance Spectroscopy 

Nuclear Magnetic Resonance Spectroscopy, using, e.g., 1 H, 13 C, 15 N, 13 P or 2 H, has proven to be of great use in 
studying protein structure in solution. Focusing on 1 H, each hydrogen atom in a molecule has a nuclear magnetic spin, 

35 i.e. the nuclei of the atom act like tiny magnets. In the absence of an external magnetic field, the magnetic moments 
of the protons are randomly oriented. In a Nuclear Magnetic Resonnance experiment, a strong external magnetic field 
is applied to the sample along a specified direction, resulting in a net alignment of the magnetic moments and a net 
macroscopic magnetization along the specified directional axis; a short radio-frequency pulse of appropriate strength 
is then applied, knocking the magnetization vector away from this axis. As the magnetization recovers, a transient 

40 radio-frequency signal is recorded as a function of time. A fourier-transform of this signal then yields a frequency 
spectrum. Each proton in the molecule gives rise to a peak in this spectrum occurring at some characteristic resonnance 
frequency determined by the local electronic environment of that proton. The resonance frequency of a particular proton 
is called its "chemical shift" and is measured as an offset from some reference frequency. Structured information from 
NMR is derived from the nuclear Overhauser effect ("NOE", which determines whether a pair of protons are near each 

45 other in space) and the coupling constants of protons that are separated by three or fewer chemical bonds. NOE and 
coupling constants provide one-dimensional data; two-dimensional data is provided by inter alia nuclear Overhauser 
enhancement spectroscopy (NOESY) and two-dimensional correlation spectroscopy (COSY); and from such data, 
three-dimensional protein structures can be determined. 

In view of the foregoing information set forth with respect to determination of the three-dimensional structure of 

50 protein molecules, the following claims directed to DNA macromolecules and amino acids inherently include the three 
dimensional structures associated with the protein molecules expressed thereby. 

The Examples herein are not to be construed as limited to specific vectors, plasmids and host cells which are 
preferred. The rSLO described herein is not to be construed as limited solely to the preferred rSLO designated rSLO. 
3, or to the preferred vectors, plasmids and host cells. Similarly, the preferred rSL0.3in no way constitute an admission, 

55 either actual or implied, that the DNA and amino acid sequences thereof are the only DNA and amino acid sequences 
to which Applicants are entitled. They are entitled to the full breadth of protection under applicable patent laws. 

For purposes of claiming materials by designation, AR120 transformed with plasmids comprising pA33 - rSLO.3 
and JM105 transformed with plasmids comprising pBTac2 DNA - rSLO.3 were deposited on August 23, 1991 with the 
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American Type Culture Collection (ATCC), 12301 Parklawn Drive, Rockville, Maryland, 20852, under the provisions 
of the Budapest Treaty for the International Recognition of the Deposit of Microorganisms for the Purpose of Patent 
Procedure. These were tested by the ATCC on August 27, 1991, and determined to both be viable. The ATCC has 
assigned the deposit numbers ATCC 68675 and ATCC 68677, respectively, to these materials. 

s Based upon the disclosure herein, those skilled in the art can readily obtain fragments of the DNA sequence set 

forth in Figure 1 such that the fragment continues to maintain at least one epitopic site characteristic of wild-type SLO 
and maintains hemolytic characteristics. Furthermore, and as noted, conservative substitutions of nucleotides can be 
made without concomitant changes in the amino acid sequence, as those in the art understand and appreciate. For 
example, "computerized back translation" techniques can be used, whereby the amino acid sequence is analyzed by 

10 a computer and the computer determines the optimum nucleotides to utilize in the codons necessary to encode such 
amino acids. Additionally, as the DNA synthesis art progresses such that oligonucleotides having the length of the DNA 
sequence of Figure 1 can be rapidly obtained, one can synthesize that sequence as appropriate with such advances 
in the art. 

Because the screening of SLO derivatives can be readily accomplished using the above-described blood overlay 
15 technique, numerous SLO derivative candidates can be rapidly evaluated. Accordingly, those skilled in the art can 
readily use this technique to derive SLO derivative analog candidates, rapidly screen these candidates for indications 
of hemolytic activity, and determine the nucleic acid and amino acid sequences of analogs. 

Accordingly, while the Examples herein are directed to a specific SLO variant, rSLO.3, and because having had 
this advance in the art placed in their possession, those in the art can utilize techniques known to the art to adapt this 
20 advance to their own ends, Applicant's invention is seen to comprehend SLO derivatives having the characteristics as 
defined, and is not limited to the specific derivative disclosed in the Examples. 

Although the present invention has been described in considerable detail with regard to certain preferred embod- 
iments thereof, other embodiments within the scope of the teachings of the present invention are possible. As such, 
while the production of a specific SLO derivative has been described in detail, this is to be construed as an exemplar. 
25 Accordingly, neither the disclosure, nor the claims to follow, are intended, nor should be construed to be, limited by the 
descriptions of the preferred embodiments contained herein. 

37 C.F.R. §1.821(c) SEQUENCE LISTING 

30 (1 ) GENERAL INFORMATION 

(i) APPLICANT: Adams, Craig M. and Wang, Eva Y. 

(ii) TITLE OF INVENTION: Streptolysin O Derivatives 

35 

(iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

40 (A) ADDRESSEE: Beckman Instruments, Inc. 

(B) STREET: 2500 Harbor Blvd. 

(C) CITY: Fullerton 

(D) STATE: California 

(E) COUNTRY: USA 
45 (F) ZIP: 92634 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.5 inch, 1.44 Mb 
so (B) COMPUTER: IBM 

(C) Operating System: MS. DOS 

(D) SOFTWARE: WordPerfect 5.1 

(vi) CURRENT APPLICATION DATA: 

55 

(A) APPLICATION NUMBER: (Herewith) 

(B) FILING DATE: (Herewith) 

(C) CLASSIFICATION: 
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(vii) PRIOR APPLICATION DATA: (Not Applicable) 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(viii) ATTORNEY/AGENT INFORMATION 

(A) NAME: Burgoon, Richard P. 

(B) REGISTRATION NUMBER: 34,787 

(C) REFERENCE/DOCKET NUMBER: 128D-1023 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (714) 773-7610 

(B) TELEFAX: (714) 773-7936 

(2) INFORMATION FOR SEQ ID NO.: 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1524 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: genomic 

(B) CLONE: rSLO.3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 1 
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ATG GAT CCG TCA 
5 GAA ATC AAT GAC 

CTT GCT AAA AAT 
w GTT AAG AAA GCT 

ATC AAC ACT ACA 
75 AGG ACC TAT CCA 

AAC AAA CCA GAC 
20 ATT GAT TTA CCA 

CCT ACC TAT GCC 
25 TGG CAT GAT AAT 

TAT ACT GAA TCA 
30 AAT GTT AAT AGC 

TCG ATT TCA AAA 
35 ATT TTT TAC ACC 

TTT GAT AAA TCA 
40 AAT GAA GCT CCG 

GTT TTT GTC AAA 

GCC TTT AGT GCA 

TAC TCT GAT ATC 

50 

GGA GAT GCT GCA 
ATT AGA AAC GTT 
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GAA GAC AAA AAA AAG AGC GAA 
AAG ATT TAT TCA CTA AAT TAT 
GGT GAA ACC ATT GAA AAT TTT 
GAT AAA TTT ATT GTC ATT GAA 
CCA GTC GAT ATT TCC ATC ATT 
GCA GCC CTT CAG CTG GCT AAT 
GCG GTA GTC ACC AAG CGA AAC 
GGT ATG GGA GAC AAA GCA ACG 
AAT GTT TCA ACA GCT ATT GAT 
TAT TCT GGT GGT AAT ACG CTT 
ATG GTA TAT TCT AAG TCA CAG 
AAA ATC TTA GAT GGT ACT TTA 
GGT GAA AAG AAG GTG ATG ATT 
GTA TCA GCA AAC CTT CCT AAT 
GTG ACC TTT AAA GAG TTG CAA 
CCA CTC TTT GTG AGT AAC GTA 
CTA GAA ACA AGT TCT AAA AGT 
GCT CTA AAA GGA ACA GAT GTT 
TTA GAA AAT AGC TCA TTT ACA 
GAG CAC AAT AAG GTA GTC ACA 
ATC AAA GAC AAT GCT ACC TTC 



GAA GAT CAC ACT GAA 4 8 

AAT GAG CTT GAA GTA 96 

GTT CCT AAA GAA GGC 144 

AGA AAG AAA AAA AAT 192 

GAC TCT GTC ACT GAT 240 

AAA GGT TTT ACC GAA 288 

CCA CAA AAA ATC CAT 336 

GTT GAG GTC AAT GAC 384 

AAT CTT GTT AAC CAA 432 

CCT GCC AGA ACA CAA 480 

ATT GAA GCA GCT CTA 528 

GGC ATT GAT TTC AAG 576 

GCA GCA TAC AAG CAA 624 

AAT CCT GCG GAT GTG 672 

CGA AAA GGT GTC AGC 720 

GCC TAT GGT CGA ACT 768 

AAT GAT GTT GAA GCG 816 

AAA ACT AAT GGA AAA 864 

GCT GTC GTT TTA GGA 912 

AAA GAC TTT GAT GTT 960 

AGT AGA AAA AAC CCA 1008 
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GCT TAT CCT ATT TCA TAC ACC AGT GTT TTC CTT AAA AAT AAT AAA ATT 1056 

GCG GGT GTC AAT AAC AGA ACT GAA TAC GTT GAA ACA ACA TCT ACC GAG 1104 

TAC ACT AGT GGA AAA ATT AAC CTG TCT CAT CAA GGC GCG TAT GTT GCT 1152 

CAA TAT GAA ATC CTT TGG GAT GAA ATC AAT TAT GAT GAC AAA GGA AAA 1200 

GAA GTG ATT ACA AAA CGA CGT TGG GAT AAC AAC TGG TAT AGT AAG ACA 124 8 

TCA CCA TTT AGC ACA GTT ATC CCA CTA GGA GCT AAT TCA CGA AAT ATA 1296 

CGT ATC ATG GCT AGA GAG TGC ACC GGC TTA GCT TGG GAA TGG TGG CGA 1344 

AAA GTG ATC GAC GAA AGA GAT GTG AAA CTG TCT AAA GAA ATC AAT GTC 1392 

AAC ATC TCA GGA TCA ACC CTG AGC CCA TAT GGT TCG ATT ACT TAT AAG 1440 

TAG GAC TGG TTC AAG AGG TTC GTC AAG CAC CTT GAT GCT GCT TAT CTC 1488 

TTG AGA TCC CCG GGT AGG CCT AGT TAA CTA GTC GAC 1524 

(3) INFORMATION FOR SEQ ID NO.: 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 amino acids 

(B) TYPE: Amino Acid 

(C) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: protein 
(ix) FEATURE: 

(A) NAME/KEY: Signal sequence 

(B) LOCATION: amino acid 98 to amino acid 571 of SLO 

(C) IDENTIFICATION METHOD: experimentally determined based upon production of soluble, hemolyt- 
ically active SLO from recombinant vector 

(D) OTHER INFORMATION: Lyses red blood cells 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 2 
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Met Asp Pro Ser Glu 

-2 -1 

Glu lie Asn Asp Lys 
15 

Leu Ala Lys Asn Gly 

35 

Val Lys Lys Ala Asp 
50 

lie Asn Thr Thr Pro 
65 

Arg Thr Tyr Pro Ala 
80 

Asn Lys Pro Asp Ala 
95 

lie Asp Leu Pro Gly 

115 

Pro Thr Tyr Ala Asn 
130 

Trp His Asp Asn Tyr 
145 

Tyr Thr Glu Ser Met 
160 



Asp Lys Lys Lys Ser Glu 
5 

lie Tyr Ser Leu Asn Tyr 
20 25 

Glu Thr lie Glu Asn Phe 

40 

Lys Phe lie Val lie Glu 
55 

Val Asp lie Ser lie lie 
70 

Ala Leu Gin Leu Ala Asn 
85 

Val Val Thr Lys Arg Asn 
100 105 

Met Gly Asp Lys Ala Thr 

120 

Val Ser Thr Ala lie Asp 
135 

Ser Gly Gly Asn Thr Leu 
150 

Val Tyr Ser Lys Ser Gin 
165 



Glu Asp His Thr Glu 
10 

Asn Glu Leu Glu Val 

30 

Val Pro Lys Glu Gly 
45 

Arg Lys Lys Lys Asn 

60 

Asp Ser Val Thr Asp 
75 

Lys Gly Phe Thr Glu 
90 

Pro Gin Lys lie His 

110 

Val Glu Val Asn Asp 
125 

Asn Leu Val Asn Gin 
140 

Pro Ala Arg Thr Gin 
155 

lie Glu Ala Ala Leu 
170 
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Asn Val Asn Ser Lys lie Leu Asp Gly Thr Leu Gly lie Asp Phe Lys 
175 180 185 190 

Ser lie Ser Lys Gly Glu Lys Lys Val Met lie Ala Ala Tyr Lys Gin 

195 200 205 

lie Phe Tyr Thr Val Ser Ala Asn Leu Pro Asn Asn Pro Ala Asp Val 
210 215 220 

Phe Asp Lys Ser Val Thr Phe Lys Glu Leu Gin Arg Lys Gly Val Ser 
225 230 235 

Asn Glu Ala Pro Pro Leu Phe Val Ser Asn val Ala Tyr Gly Arg Thr 
240 245 250 

Val Phe Val Lys Leu Glu Thr Ser Ser Lys Ser Asn Asp Val Glu Ala 
255 260 265 270 

Ala Phe Ser Ala Ala Leu Lys Gly Thr Asp Val Lys Thr Asn Gly Lys 

275 280 285 

Tyr Ser Asp lie Leu Glu Asn Ser Ser Phe Thr Ala Val Val Leu Gly 
290 295 300 

Gly Asp Ala Ala Glu His Asn Lys Val Val Thr Lys Asp Phe Asp Val 
305 310 315 

lie Arg Asn Val lie Lys Asp Asn Ala Thr Phe Ser Arg Lys Asn Pro 
320 325 330 

Ala Tyr Pro He Ser Tyr Thr Ser Val Phe Leu Lys Asn Asn Lys He 
335 340 345 350 

Ala Gly Val Asn Asn Arg Thr Glu Tyr Val Glu Thr Thr Ser Thr Glu 

355 360 365 
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Tyr Thr Ser Gly Lys lie Asn Leu Ser His Gin Gly Ala Tyr val Ala 
370 375 380 

Gin Tyr Glu lie Leu Trp Asp Glu lie Asn Tyr Asp Asp Lys Gly Lys 
385 390 395 

Glu Val lie Thr Lys Arg Arg Trp Asp Asn Asn Trp Tyr Ser Lys Thr 
400 405 410 

Ser Pro Phe Ser Thr Val He Pro Leu Gly Ala Asn Ser Arg Asn He 
41 5 420 425 430 

Arg He Met Ala Arg Glu Cys Thr Gly Leu Ala Trp Glu Trp Trp Arg 

435 440 445 

Lys Val He Asp Glu Arg Asp Val Lys Leu Ser Lys Glu He Asn Val 
450 455 460 

Asn lie Ser Gly Ser Thr Leu Ser Pro Tyr Gly Ser He Thr Tyr Lys 
465 470 475 

Claims 

1. A purified and isolated DNA sequence encoding a soluble fusion product of Streptolysin O evidencing hemolytic 
activity. 

2. The DNA sequence of Claim 1 , wherein said DNA sequence is as set forth in SEQ ID No 1 . 

3. A prokaryotic or eukaryotic host cell transformed or transfected with a DNA sequence according to claim 2 in a 
manner allowing the host cell to express a soluble, hemolytically active fusion product of Streptolysin O. 

4. A DNA plasmid comprising a DNA vector and DNA sequence according to claim 2. 

5. An amino acid sequence encoding a soluble, hemolytically active fusion product of Streptolysin O, wherein said 
amino acid sequence is as set forth in SEQ ID No 2. 

6. The DNA plasmid of claim 4, wherein the DNA vector comprises a promoter gene. 
Patentanspriiche 

1. Gereinigte und isolierte DNA-Sequenz, welche ein losliches Fusionsprodukt von Streptolysin-O kodiert, das ha- 
molytische Aktivitat aufweist. 

2. DNA-Sequenz aus Anspruch 1, wobei die genannte DNA-Sequenz die in SEQ ID Nr. 1 dargelegte Sequenz ist. 



21 



EP 0 555 439 B1 



3. Mit einer DNA-Sequenz nach Anspruch 2 derart transformierte Oder transfizierte prokaryontische Oder eukaryon- 
tische Wirtszelle, daR die Wirtszelle ein losliches, hamolytisch aktives Fusionsprodukt von Streptolysin-O expri- 
mieren kann. 

4. DNA-Plasmid, welches einen DNA-Vektor und eine DNA-Sequenz gemaB Anspruch 2 umfaBt. 

5. Aminosaure-Sequenz, welche fur ein losliches, hamolytisch aktives Fusionsprodukt von Streptolysin-O kodiert, 
wobei die genannte Aminosaure-Sequenz die in SEQ ID Nr. 2 angegebene Sequenz ist. 

6. DNA-Plasmid nach Anspruch 4, bei welchem der DNA-Vektor ein Promotor-Gen umfaRt. 



Revendications 

1. Sequence d'ADN purifiee et isolee codant pour un produit de fusion soluble de la Streptolysine O mettant en 
evidence une activite hemolytique. 

2. Sequence d'ADN de la revendication 1 ou ladite sequence d'ADN est telle qu'indiquee dans SEQ ID N° 1 . 

3. Cellule note procaryote ou eucaryote transformed ou transfectee par une sequence d'ADN selon la revendication 
2 d'une maniere permettant a la cellule note d'exprimer un produit de fusion soluble, hemolytiquement actif de la 
Streptolysine O. 

4. ADN plasmidique comprenant un vecteur d'ADN et une sequence d'ADN selon la revendication 2. 

5. Sequence d'acides amines codant pour un produit de fusion soluble, hemolytiquement actif de la Streptolysine O 
ou ladite sequence d'acides amines est telle qu'indiquee dans SEQ ID N° 2. 

6. ADN plasmidique de la revendication 4 ou le vecteur d'ADN comprend un gene promoteur. 
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ATG GAT CCG TCA GAA GAG AAA AAA AAG AGC GAA GAA GAT CAC ACT GAA 4 8 

GAA ATC AAT GAC AAG ATT TAT TCA CTA AAT TAT AAT GAG CTT GAA GTA 9 6 

CTT GCT AAA AAT GGT GAA ACC ATT GAA AAT TTT GTT CCT AAA GAA GGC 14 4 

GTT AAG AAA GCT GAT AAA TTT ATT GTC ATT GAA AGA AAG AAA AAA AAT 19 2 

ATC AAC ACT ACA CCA GTC GAT ATT TCC ATC ATT GAC TCT GTC ACT GAT 24 0 

AGG ACC TAT CCA * GCA GCC CTT CAG CTG GCT AAT AAA GGT TTT ACC GAA 288 

AAC AAA CCA GAC GCG GTA GTC ACC AAG CGA AAC CCA CAA AAA ATC CAT 33 6 

ATT GAT TTA CCA GGT ATG GGA GAC AAA GCA ACG GTT GAG GTC AAT GAC 3 84 

CCT ACC TAT GCC AAT GTT TCA ACA GCT ATT GAT AAT CTT GTT AAC CAA 43 2 

TGG CAT GAT AAT TAT TCT GGT GGT AAT ACG CTT CCT GCC AGA ACA CAA 4 80 

TAT ACT GAA TCA ATG GTA TAT TCT AAG TCA CAG ATT GAA GCA GCT CTA 52 8 

AAT GTT AAT AGC AAA ATC TTA GAT GGT ACT TTA GGC ATT GAT TTC AAG 57 6 

TCG ATT TCA AAA GGT GAA AAG AAG GTG ATG ATT GCA GCA TAC AAG CAA 62 4 

ATT TTT TAC ACC GTA TCA GCA AAC CTT CCT AAT AAT CCT GCG GAT GTG 672 

TTT GAT AAA TCA GTG ACC TTT AAA GAG TTG CAA CGA AAA GGT GTC AGC 720 

AAT GAA GCT CCG CCA CTC TTT GTG AGT AAC GTA GCC TAT GGT CGA ACT 768 

GTT TTT GTC AAA CTA GAA ACA AGT TCT AAA AGT AAT GAT GTT GAA GCG 816 

GCC TTT AGT GCA GCT CTA AAA GGA ACA GAT GTT AAA ACT AAT GGA AAA 864 

TAC TCT GAT ATC TTA GAA AAT AGC TCA TTT ACA GCT GTC GTT TTA GGA 912 

GGA GAT GCT GCA GAG CAC AAT AAG GTA GTC ACA AA/\ GAC TTT GAT GTT 9 60 

ATT AGA AAC GTT ATC AAA GAC AAT GCT ACC TTG AGT AGA AAA AAC CCA 100 8 

GCT TAT CCT ATT TCA TAC ACC AGT GTT TTC CTT AAA AAT AAT AAA ATT 105 6 

GCG GGT GTC AAT AAC AGA ACT GAA : TAC GTT GAA ACA ACA TCT ACC GAG 1104 

TAC ACT AGT GGA AAA ATT AAC CTG TCT CAT CAA GGC GCG TAT GTT GCT 1152 

CAA TAT GAA ATC CTT TGG GAT GAA ATC AAT TAT GAT GAC AAA GGA AAA 1200 



Fig. 1 
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GAA GTG ATT ACA AAA CGA CGT TGG GAT AAC AAC TGG TAT AGT AAG ACA 12 4 8 

TCA CCA TTT AGC ACA GTT ATC CCA CTA GGA GCT AAT TCA CGA AAT ATA 12 9 6 

CGT ATC ATG GCT AGA GAG TGC ACC GGC TTA GCT TGG GAA TGG TGG CGA 1344 

AAA GTG ATC GAC GAA AGA GAT GTG AAA CTG TCT AAA GAA ATC AAT GTC 13 9 2 

AAC ATC TCA GGA TCA ACC CTG AGC CCA TAT GGT TCG ATT ACT TAT AAG 144 0 

TAG GAC TGG TTC AAG AGG TTC GTC AAG CAC CTT GAT GCT GCT TAT CTC 14 8 8 

TTG AGA TCC CCG GGT AGG CCT AGT TAA CTA GTC GAC 1524 



Fig. 1 
(Continued) 
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Met Asp Pro Ser Glu Asp Lys Lys Lys Ser Glu Glu Asp His Thr Glu 
-2-1 '5 10 

Glu lie Asn Asp Lys lie Tyr Ser Leu Asn Tyr Asn Glu Leu Glu Val 

15 2 0 2 5 3 0 

Leu -Ala Lys Asn Gly Glu Thr He Glu Asn Phe Val Pro Lys Glu Gly 

35 40 45 

Val Lys Lys Ala Asd Lys Phe He Val He Glu Arg Lys Lys Lys Asn 
50 " 55 60 

He Asn Thr Thr Pro Val Asp He Ser He He Asp Ser Val Thr Asp 
65 70 75 

Arg Thr Tyr Pro Ala Ala Leu Gin Leu Ala Asn Lys Gly Phe Thr Glu 
80 85 90 

Asn Lys Pro Asp Ala Val Val Thr Lys Arg Asn Pro Gin Lys He His 
95 ' ~ 100 105 - 110 

lie Asp Leu Pro Gly Met Gly Asp Lys Ala Thr Val Glu Val Asn Asp 

115 120 125 

Pro Thr Tyr Ala Asn Val Ser Thr Ala He Asd Asn Leu Val Asn Gin 
130 135 . " 140 

Trp His Asp Asn Tyr Ser Gly Gly Asn Thr Leu Pro Ala Arg Thr Gin 
145 * 150 155 

Tyr Thr Glu Ser Met Val Tyr Ser Lys Ser Gin He Glu Ala Ala Leu 
160 165 170 

Asn Val Asn Ser Lys He Leu Asp Gly Thr Leu Gly He Asd Phe Lvs 
175 " 130 * 185 " 19*0 

Ser He Ser Lys Gly Glu Lys Lvs Val Met He Ala Ala Tyr Lys Gin 

195 200 " 205 

He Phe Tyr Thr Val Ser Ala Asn Leu Pro Asn Asn Pro Ala Asp Val 
210 215 220 

Phe Asp Lys Ser Val Thr Phe Lys Glu Leu Gin Arg Lys Gly Val Ser 
225 230 235 

Asn Glu Ala Pro Pro Leu Phe Val Ser Asn Val Ala Tyr Gly Arg Thr 
240 245 250 



Fig. 2 
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EP 0 555 439 B1 



Val Phe Val Lys Leu Glu Thr Ser Ser Lys Ser Asn Asp Val Glu Ala 
255 260 265 290 

Ala Phe Ser Ala Ala Leu Lys Gly Thr Asp Val Lys Thr Asn Gly Lys 

275 280 285 

Tyr Ser Asp lie Leu Glu Asn Ser Ser Phe Thr Ala Val Val Leu Gly 
290 295 300 

Gly Asp Ala Ala Glu His Asn Lys. Val Val Thr Lys Asp Phe Asp Val 
305 310 315 

lie Arg Asn Val lie Lys Asp Asn Ala Thr Phe Ser Arg Lys Asn Pro 
320 325 330 

Ala Tyr Pro lie Ser Tyr Thr Ser Val Phe Leu Lys Asn Asn Lys lie 
335 340 345 350 

Ala Gly Val Asn Asn Arg Thr Glu Tyr Val Glu Thr Thr Ser Thr Glu 

355 360 365 

Tyr Thr Ser Gly Lys lie Asn Leu Ser His Gin Gly Ala Tyr Val Ala 
370 375 * 380 

Gin Tyr Glu He Leu Trp Asp Glu He A.sn Tyr Asp Asp Lys Gly Lys 
385 " 390 395 

Glu Val He Thr Lys Arg Arg Trp Asp Asn Asn Trp Tyr Ser Lys Thr 
400 405 410 

Ser Pro Phe Ser Thr Val He Pro Leu Gly Ala Asn Ser Arg Asn lie 
415 420 425 ~* 430 

Arg He Met Ala Arg Glu Cys Thr Gly Leu Ala Trp Glu Trp Tro Arg 
■ " 435 440 " 445 

Lys Val lie Asp Glu Arg Asp val Lys Leu Ser Lys Glu lie Asn Val 
450 455 460 

Asn He Ser Gly Ser Thr Leu Ser Pro Tyr. Gly Ser He Thr Tyr Lys 
465 470 - 475 



Fig. 2 
(Continued) 
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